Regular Language Distance and Entropy
نویسندگان
چکیده
This paper addresses the problem of determining the distance between two regular languages. It will show how to expand Jaccard distance, which works on finite sets, to potentially-infinite regular languages. The entropy of a regular language plays a large role in the extension. Much of the paper is spent investigating the entropy of a regular language. This includes addressing issues that have required previous authors to rely on the upper limit of Shannon’s traditional formulation of entropy, because its limit does not always exist [2, 6, 13]. The paper also includes proposing a new limit based formulation for the entropy of a regular language and proves that formulation to both exist and be equivalent to Shannon’s original formulation (when it exists). Additionally, the proposed formulation is shown to equal an analogous but formally quite different notion of topological entropy from Symbolic Dynamics – consequently also showing Shannon’s original formulation to be equivalent to topological entropy. Surprisingly, the natural Jaccard-like entropy distance is trivial in most cases. Instead, the entropy sum distance metric is suggested, and shown to be granular in certain situations.
منابع مشابه
On the Relationship between Complexity and Entropy for Markov Chains and Regular Languages
Using the past-future mutual information as a measure of complexity, the relation between the complexity and the Shannon entropy is determined analytically for sequences generated by Markov chains and regular languages. It is emphasized that, given an entropy value, there are many possible complexity values, and vice versa; that is, the relationship between complexity and entropy is not one-to-...
متن کاملTopological Entropy of Formal Languages
In this workwe shall consider a notion of complexity of formal languages L that is inspired by the concept of entropy from dynamical systems. More precisely, we shall define the topological entropy of L to be the exponential growth-rate of the restrictions of the Nerode congruence relation of L to words of length at most n. We shall show that the topological entropy of regular languages is alwa...
متن کاملAn Analysis of Ministry of Education’s Strategic Plans Based on Favorable Components of English Language Teaching Using Shannon’s Entropy
The present research aims to analyze the content of Ministry of Education’s strategic plans (the Fundamental Reform Document of Education, the Comprehensive National Scientific Plan and the National Curriculum Document) based on Shannon's entropy regarding the favorable components of teaching English. The contents of the Fundamental Reform Document of Education, the Comprehensive National Scien...
متن کاملTitle: Content analysis of English high school textbooks based on Shannon entropy method
Introduction: English language teaching curriculum is very important in effective teaching and learning of students. In order to pay attention to the importance of teaching English as one of the most important communication tools, it is necessary to develop a curriculum that can accommodate all the necessary English language teaching needs. Therefore, the purpose of this study is to analyze t...
متن کاملLanguage Complexity of Unimodal Systems
The complexity of formal languages which are generated by S-unimodal systems on interval covers is studied. It is shown that there exist S-unimodal systems with nonrecursive languages, those with recursive but not context-sensitive languages, and those with context-sensitive but not regular languages. It is also shown that S-unimodal systems with regular languages include all systems with finit...
متن کامل